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DETERMINING PLACEMENT OF DISTRIBUTED APPLICATION 
ONTO DISTRIBUTED RESOURCE INFRASTRUCTURE 

Related Applications: 

The following applications disclose related subject matter: U.S. Application 
No. (Attorney Docket No. 200208414-1) , filed (on the same day as this application) 
and entitled, "Determination of One or More Variables to Receive Value Changes in 
Local Search Solution of Integer Programming Problem"; and U.S. Application No. 
(Attorney Docket No. 200209180-1) , filed (on the same day as this application) and 
entitled, "Incorporating Constraints and Preferences for Determining Placement of 
Distributed Application onto Distributed Resource Infrastructure"; the contents of all 
of which are hereby incorporated by reference. 

Field of the Invention 

The present invention relates to the field of placing a distributed application 
onto a distributed resource infrastructure. More particularly, the present invention 
relates to the field of placing a distributed application onto a distributed resource 
infrastructure where the distributed application and the distributed resource 
infrastructure have arbitrary communication topologies. 

Background of the Invention 

A distributed application includes a plurality of services. Each of the services 
performs a task or tasks as part of the distributed application. Often the distributed 
application is placed on a network of computers. The network of computers forms a 
distributed resource infrastructure where each of the computers forms a node. 
Performance of the distributed application depends on optimizing a placement of the 
services onto the nodes. 

A first method of the prior art uses parameters for individual nodes to 
determine a placement of the services onto the nodes. Such parameters include 
processing and storage capabilities of the nodes. Services are placed onto the nodes 
so that processing and storage requirements of the services on a particular node do not 
the processing and storage capabilities of the node. 

The first method does not consider relationships among the nodes or among 
the services in the determination of the placement of the services onto the nodes. A 
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second method of the prior art considers topologies between the services and between 
the nodes but requires that the topologies be fixed in certain configurations. The 
second method does not determine a placement of the services onto the nodes where 
an arbitrary topology exists between the nodes or between the services. 

What is needed is a method of determining a placement of services of a 
distributed application onto nodes of a distributed resource infrastructure taking into 
account arbitrary topologies between the nodes and between the services. 

Summary of the Invention 

The present invention is a method of determining a placement of services of a 
distributed application onto nodes of a distributed resource infrastructure. In an 
embodiment of the present invention, the method comprises first, second, and third 
steps. The first step forms communication constraints between node pairs. The 
communication constraints ensure that a sum of transport demands between a 
particular node pair does not exceed a transport capacity between the particular node 
pair. Each term of the sum comprises a product of a first placement variable, a second 
placement variable, and the transport demand between the services associated with the 
first and second placement variables. The second step forms an objective. The 
communication constraints and the objective comprise an integer program. The third 
step employs a local search solution to solve the integer program, which determines 
the placement of the services onto the nodes. 

These and other aspects of the present invention are described in more detail 

herein. 

Brief Description of the Drawings 

The present invention is described with respect to particular exemplary 
embodiments thereof and reference is accordingly made to the drawings in which: 

Figure 1 schematically illustrates a distributed application according to an 
embodiment of the present invention; 

Figure 2 schematically illustrates a distributed resource infrastructure 
according to an embodiment of the present invention; 

Figure 3 illustrates a preferred method of determining a placement of a 
distributed application onto a distributed resource infrastructure as a block diagram 
according to an embodiment of the present invention; 
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Figure 4 illustrates a first alternative method of determining a placement of a 
distributed application onto a distributed resource infrastructure as a block diagram 
according to an embodiment of the present invention; 

Figure 5 schematically illustrates an alternative distributed application 
according to an embodiment of the present invention; 

Figure 6 schematically illustrates an alternative distributed resource 
infrastructure according to an embodiment of the present invention. 

Figure 7 illustrates a local search solution method as a flow chart according to 
an embodiment of the present invention; and 

Figure 8 illustrates a system for determining a placement of a distributed 
application onto a distributed resource infrastructure according to an embodiment of 
the present invention. 

Detailed Description of a Preferred Embodiment 

The present invention determines a placement of a distributed application onto 
a distributed resource infrastructure. The distributed application comprises a plurality 
of services. The distributed resource infrastructure comprises a plurality of nodes. 

A distributed application embodiment is illustrated schematically in Figure 1 . 
The distributed application embodiment 100 comprises a firewall 102, a local buffer 
104, first, second, and third web servers, 106, 108, and 1 10, an application server 1 12, 
and a database 114, each of which forms a service. The firewall 102 is coupled to the 
local buffer 104. Each of the first, second, and third web servers, 106, 108, and 110, 
couples the local buffer 104 to the application server 1 12. The application server 1 12 
is coupled to the database 114. In operation, the distributed application embodiment 
100 provides web access to users who provide and obtain information from the 
database 114. 

A distributed resource infrastructure embodiment is illustrated schematically 
in Figure 2. The distributed resource infrastructure embodiment 200 comprises first 
through eighth nodes, 202. .216, and a switch 218. The switch 218 couples each of the 
first through eighth nodes, 202..216, to remaining nodes of the first through eighth 
nodes, 202..216. The determination of the placement of the distributed application 
embodiment 100 onto the distributed resource infrastructure embodiment 200 is 
accomplished according to an objective such as minimizing network traffic, 
minimizing latency in order to reduce response time, or balancing a load on the nodes. 
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A preferred method of the present invention is illustrated as a block diagram in 
Figure 3. The preferred method 300 comprises first through third steps, 302..306. 
The first step 302 forms communication constraints, which ensure that transport 
demands between node pairs do not exceed transport capacities between the node 
pairs. Each of the communication constraints is made up of a sum of terms. Each of 
the terms comprises a product of a first placement variable, a second placement 
variable, and the transport demand between the services associated with the first and 
second placement variables. The second step 304 forms the objective. In the third 
step 306, a local search solution is employed to solve an integer program comprising 
the communication constraints and the objective, which provides the placement of the 
service onto the nodes. 

A first alternative method of the present invention is illustrated as a block 
diagram in Figure 4. The first alternative method 400 adds fourth and fifth steps to 
the preferred method 300. The fourth step 402 establishes an application model, 
which comprise the transport demands between the services. The fifth step 404 
establishes an infrastructure model, which comprises the transport capacities between 
the nodes. The application model and the infrastructure model provide the transport 
demands and capacities to the preferred method 300. 

An alternative distributed application embodiment is illustrated schematically 
in Figure 5. The alternative distributed application embodiment 500 comprises first 
through fourth services, 501 ..504, and fifth through Sth services, 505. 
Mathematically, the first through Sth services, 501. .505, are expressed as s e {1, 2, 3, 

S}. Each pair of the services has an associated transport demand. For example, a 
first transport demand dti2 represents communication traffic between the first and 
second services, 501 and 502. A transport demand matrix Dt lists the transport 
demands between the first through Sth services, 501. .505, as follows. 

r - dtn dtn .. dtis" 

dt21 - dt23 .. dt2S 
Dt= dt31 dt32 - .. dt3S 

^dtsi dts3 dts3 .. 

Since services do not communicate with themselves over a network, the transport 
demands along a matrix diagonal have no values. Further, depending upon a 
particular implementation it may be sufficient to characterize the transport demands 
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without reference to direction in which case the transport demands below the matrix 
diagonal would also have no values. 

Each of the first through Sth services, 501. .505, of the alternative distributed 
application 500 is also characterized with a processing demand and a storage demand. 
For example, the first service 501 has a first processing demand dpi and a first storage 
demand dsi. A processing demand vector Dp and a storage demand vector D s list the 
processing demands and the storage demands of the first through Sth servers, 
501.. 505, as follows. 



Dp = 



'dpr 

dp2 
dp3 

IdpsJ 



An alternative distributed resource infrastructure is illustrated schematically in 
Figure 6. The alternative distributed resource infrastructure 600 comprises first 
through fourth nodes, 601.. 604, and fifth through Nth nodes, 605. Mathematically, 
the fist through Nth nodes are expressed as n e {1,2, 3, N}. Each pair of the 
nodes has an associated transport capacity. For example, a first transport capacity cti2 
represents communication bandwidth between the first and second nodes, 601 and 
602. A transport capacity matrix Ct lists the transport capacities between the first 
through Nth nodes, 601. .605, as follows. 

Ctl2 Ctl3 .. CtlN^ 
Ct21 — Ct23 .. Ct2N 
Ct= Ct31 Ct32 - .. dt3N 



^CtNl CtN3 CtN3 .. — , 

Since nodes do not communicate with themselves over a network, the transport 
capacities along a matrix diagonal have no values. Further, depending upon a 
particular implementation it may be sufficient to characterize the transport capacities 
without reference to direction in which case the transport capacities below the matrix 
diagonal would also have no values. 

Each of the first through Nth nodes, 601 ..605, of the alternative distributed 
resource infrastructure 600 is also characterized with a processing capacity and a 
storage capacity. For example, the first node 601 has a first processing capacity cpi 
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and a first storage capacity csi . A processing capacity vector Cp and a storage 
capacity vector Cs list the processing capacities and the storage capacities of the first 
through Nth nodes, 601 ..605, as follows. 



Cp = 



{cpu J V CSN , 
In some situations, the distributed application under consideration operates 
solely on the distributed resource infrastructure. In this situation, the transport, 
processing, and storage capacities represent absolute capacities for the nodes. In other 
situations, the distributed application is one of a plurality of distributed applications 
operating on the distributed resource infrastructure. In these other situations, the 
transport, processing, and storage capacities represent available capacities for the 
nodes. 

In the alternative distributed application embodiment 500 and the alternative 
distributed resource infrastructure embodiment 600, the transport and storage 
demands, Dt and Ds, as well as the transport and storage capacity, Ct and Cs, are 
normalized according to standard parameters of data per unit time and data, 
respectively. In an embodiment of the present invention, the processing demand Dp 
and the processing capacity Cp are normalized according to a processing criterion. 
Preferably, the processing criterion is a transaction speed especially when the 
distributed application forms a database application. Alternatively, the processing 
criterion is an algorithm speed. In another embodiment of the present invention, 
various processors are listed in a look-up table with associated processing capacities 
that have been normalized by the processing criterion. In this embodiment, a system 
implementing the present invention would go to the look-up table to find the 
processing capacity for a particular node when needed. 

Applying the first alternative method 400 (Figure 4) to the alternative 
distributed application model 500 and the alternative distributed resource 
infrastructure 600 begins with the fourth and fifth steps, 402 and 404, in which the 
transport demand and capacity matrices, Dt and Ct, are established. The first step 302 
then forms the communication constraints in first and second tasks. The first task 
forms placement variables, which according to an embodiment of the present 
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invention are Boolean variables. In a matrix notation, the placement variables are 
given by a placement variable matrix X where rows represent the services and 
columns represent the nodes. The placement variable matrix X is as follows. 

'xil X12 .. XIN^ 
X21 X22 .. X2N 

x = 

^XSl XS2 .. XSN^ 

The second task forms the communication constraints according to a communication 
constraint equation, which is given as follows. 

Z Z Xij x k i dt ik = ctji 
k i 

The second step 304 forms the objective, which according to an embodiment 
of the present invention minimizes communication traffic between the nodes and 
balances processing loads on the nodes. The latter is accomplished by minimizing the 
mathematical variance of the processing loads. The objective is given as follows. 

Minimize (1 - a) ^ £ Z Pr ° Xl i Z Z Xi J Xkl dtik + 
A i j k i 

«l Z (— Z *u dpo 2 - -J- Z (— Z dpk) 2 } 

j cp, V N V cpi k 

for where a provides a relative weight between minimizing the communication traffic 
and balancing the processing loads, A provides a normalizing factor, Proxij accounts 
for distances between the nodes, and N is a number of the nodes. The third step 306 
then employs the local search solution to solve the integer program comprising the 
communication constraints and the objective. 

Since the communication constraints account for a distributed application 
topology according to the transport demands and for a distributed resource 
infrastructure topology according to the transport capacities, the present invention 
allows arbitrary topologies for both the distributed application and the distributed 
resource infrastructure. This allows the present invention to be used for determining 
placement of applications onto infrastructures in wide variety of situations. Examples 
of such situations include placing an application onto nodes in a data center and 
placing a different application onto nodes in geographically distributed data centers. 

A second alternative method of the present invention adds processing 
constraints to the integer program. The processing constraints ensure that a sum of 
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the processing demands for a specific node does not exceed the processing capacity of 
the specific node. In an embodiment of the present invention, the processing 
constraints are formed according to a processing constraint equation, which is given 
as follows. 

A third alternative method of the present invention adds storage constraints to 
the integer program. The storage constraints ensure that a sum of the storage 
demands for a specific node does not exceed the storage capacity of the specific node. 
In an embodiment of the present invention, the storage constraints are formed 
according to a storage constraint equation, which is given as follows. 
2 Xy dsj < csj 

A fourth alternative method of the present invention adds placement 
constraints to the integer program. The placement constraints ensure that each of the 
services is placed on one and only one of the nodes. In an embodiment of the present 
invention, the placement constraints are formed according to a placement constraint 
equation, which is given as follows. 

A fifth alternative method of the present invention recognizes that, once the 
services have been placed onto the nodes, a rearrangement of the services onto the 
nodes comes at a cost. In the fifth alternative method, reassignment penalties are 
assessed when a service placement differs from an existing assignment of the service. 
According to an embodiment of the fifth alternative method, a second objective is 
added to the integer program. The second objective seeks to minimize the 
reassignment penalties. 

In the present invention, the communication constraints include terms which 
comprise products of two placement variables. Depending on the embodiment of the 
present invention, the objective also includes products of two placement variables. 
Thus, the communication constraints and possibly the objective are quadratic 
equations, i.e., equations having polynomial terms of second order. Solving integer 
programs that include polynomial terms of second or higher order is particularly 
difficult. In an embodiment of the present invention, a local search solution is 
employed according to a local search solution method disclosed in U.S. Patent 
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Application No. (Attorney Docket No. 200308414-1) filed on (the filing date of this 
application), which is incorporated by reference in its entirety. 

An embodiment of the local search solution method is illustrated as a flow 
chart in Figure 7. The local search solution method 700 comprises first through tenth 
solution steps, 702.. 720, in which a gradient following approach iteratively improves 
an initial assignment of values to the variables until a near optimum solution is 
reached. Each iteration of the local search solution produces a new assignment of 
values for the variables. The new assignment of values differs from a previous 
assignment of values by one value for a particular variable. 

The first solution step 702 defines a problem model as an overconstrained 
integer programming problem. In an embodiment of the present invention, the 
problem model comprises data, variables, and constraints. The data comprises the 
processing demands and capacities, the storage demands and capacities, and the 
transport demands and capacities. The variables comprise the placement variables, 
which are Boolean variables where a zero value indicates that a particular service is 
not located on a particular node and where a one value indicates that the particular 
service is located on the particular node. In the overconstrained integer programming 
problem, the constraints comprise hard constraints and at least one soft constraint. 
The hard constraints comprise the processing constraints, the storage constraints, the 
placement constraints, and the storage constraints. The soft constraint is the 
objective, which comprises minimizing the communication traffic between the nodes 
and balancing the processing loads on the nodes. 

In the second solution step 704, the placement variables are randomly 
initialized. The third solution step 706 selects an unsatisfied constraint. The fourth 
solution step 708 creates stores in memory for each of the placement variables in the 
unsatisfied constraint. The fifth solution step 710 parses the unsatisfied constraint by 
term. For each of the placement variables in the term, an associated store is updated 
with a change in the unsatisfied constraint due to flipping the value of the placement 
variable while holding other placement variables constant. In the sixth solution step 
712, the placement variable that is to have its value flipped is chosen according to an 
improvement criterion, such as the placement variable which most improves the 
unsatisfied constraint or the placement variable which most improves an overall 
solution while also improving the unsatisfied constraint. 
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In the seventh solution step 714, assigned values are compared to optimality 
criteria to determine whether a solution has been found. The optimality criteria for 
the overconstrained integer programming problem are no violation of the hard 
constraints and a near optimum solution for the soft constraint. If the optimality 
criteria are not met, the local search solution method 700 continues in the eighth 
solution step 716 with a determination of whether an additional iteration is to be 
performed. If so, the local search solution method 700 returns to the third solution 
step 706 of selecting another unsatisfied constraint to determine another placement 
variable which is to have its value flipped. If not, a ninth solution step 718 determines 
whether to restart the local search solution method 700 by reinitializing the variables. 
If the optimality criteria are met in the seventh solution step 714, a final value 
assignment for the placement variables is output as a result in the tenth solution step 
720. If the ninth step 210 determines to not restart the first alternative method 200, a 
"no solution found" message is output in the tenth solution step 720. 

Preferably, the local search solution method is implemented using AMPL, a 
modeling language for optimization problems. Alternatively, the local search solution 
method 700 is implemented using an alternative modeling language. 

An embodiment of a system for solving an integer program of the present 
invention is illustrated schematically in Figure 8. The system 800 includes a display 
802, input/output devices 804, a processing unit 806, a storage device 808, and a 
memory 810. The processing unit 806 couples to the display 802, the input/output 
devices 804, the storage device 808, and the memory 810. 

Employing the system 800 to solve the integer program according to the local 
search solution method 700 begins with the processing unit 806 reading the problem 
model into the memory 810. The processing unit 806 then initializes the variables by 
randomly assigning values to the variables. Next, the processing unit 806 randomly 
selects the unsatisfied constraint. Following this, the processing unit 806 creates 
stores in the memory 810 for the allowable changes of the variables in the unsatisfied 
constraint. The processing unit 806 then parses the unsatisfied constraint by term 
updating individual stores associated with the term while maintaining other variables 
constant. Following this, the processing unit 806 chooses the placement variable to 
have its value flipped according to the improvement criterion. The processing unit 
806 then determines whether the optimality condition has been met and, if not, 
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determines whether to perform more iterations or whether the local search solution 
method 700 should be restarted. 

In an embodiment of the present invention, computer code resides on a 
computer readable memory, which is read into the system 800 by one of the 
input/output devices 804. Alternatively, the computer readable memory comprises 
the storage device 808 or the memory 810. The computer code provides instructions 
for the processing unit 806 to form the problem model and to solve it according to an 
embodiment of the present invention. The computer readable memory is selected 
from a group including a disk, a tape, a memory chip, or other computer readable 
memory. 

The foregoing detailed description of the present invention is provided for the 
purposes of illustration and is not intended to be exhaustive or to limit the invention to 
the embodiments disclosed. Accordingly, the scope of the present invention is 
defined by the appended claims. 
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